SPARQL Benchmarking with Automatically Generated OLAP Queries
نویسندگان
چکیده
The growing use of data analytics on Linked Data requires SPARQL engines to efficiently execute Online Analytical Processing (OLAP) queries. While SPARQL 1.1 provides appropriate basic constructs, corresponding optimization of SPARQL engines is still in its infancy and further development lacks benchmarks that mimick the data distributions found in Link Data. In fact, existing work on OLAP benchmarking for SPARQL has usually adopted queries and data from relational databases, which may not well represent Linked Data. We map typical OLAP operations to SPARQL and propose a tool named ASPG to automatically generate OLAP queries from real-world Linked Data, which can be used to construct analytic benchmarks for SPARQL engines. We present such a benchmark called DBOB as an example which consists of queries generated using DBpedia. We apply this benchmark to a replication of DBpedia and present the result.
منابع مشابه
No Size Fits All - Running the Star Schema Benchmark with SPARQL and RDF Aggregate Views
Statistics published as Linked Data promise efficient extraction, transformation and loading (ETL) into a database for decision support. The predominant way to implement analytical query capabilities in industry are specialised engines that translate OLAP queries to SQL queries on a relational database using a star schema (ROLAP). A more direct approach than ROLAP is to load Statistical Linked ...
متن کاملQuerying Semantic Web Data Cubes
We address the problem of querying data cubes for Online Analytical Processing (OLAP) analysis, directly on the Semantic Web (SW). We rst introduce CQL, a simple algebra for querying data cubes at a conceptual level. Taking advantage of QB4OLAP metadata, we automatically translate CQL queries into SPARQL ones, and propose query optimization strategies that adapt, to the particular OLAP setting,...
متن کاملInteracting with Statistical Linked Data via OLAP Operations
Online Analytical Processing (OLAP) promises an interface to analyse Linked Data containing statistics going beyond other interaction paradigms such as follow-your-nose browsers, faceted-search interfaces and query builders. Transforming statistical Linked Data into a star schema to populate a relational database and applying a common OLAP engine do not allow to optimise OLAP queries on RDF or ...
متن کاملSM4MQ: A Semantic Model for Multidimensional Queries
On-Line Analytical Processing (OLAP) is a data analysis approach to support decision-making. On top of that, Exploratory OLAP is a novel initiative for the convergence of OLAP and the Semantic Web (SW) that enables the use of OLAP techniques on SW data. Moreover, OLAP approaches exploit different metadata artifacts (e.g., queries) to assist users with the analysis. However, modeling and sharing...
متن کاملFEASIBLE: A Featured-Based SPARQL Benchmark Generation Framework
Benchmarking is indispensable when aiming to assess technologies with respect to their suitability for given tasks. While several benchmarks and benchmark generation frameworks have been developed to evaluate triple stores, they mostly provide a one-fits-all solution to the benchmarking problem. This approach to benchmarking is however unsuitable to evaluate the performance of a triple store fo...
متن کامل